Speech analysis by subspace methods of spectral line estimation
نویسندگان
چکیده
Over frames of short time duration, filtered speech may be described as a finite linear combination of sinusoidal components. In the case of a frame of voiced speech the frequencies are considered to be harmonics of a fundamental frequency. It can be assumed further that the speech samples are observed in additive white noise of zero mean, resulting in a standard signal-plus-noise model. This model has a nonlinear dependence on the frequencies of the sinusoids but is linear in their coefficients. We use subspace line spectral estimation methods of Pisarenko and Prony type to estimate the frequencies and use the results in voiced-unvoiced classification and pitch estimation, followed by analysis of the speech waveform into its sinusoidal components.
منابع مشابه
Second order statistics spectrum estimation method for robust speech recognition
A second order statistics spectrum estimation (SOSSE) method for speech enhancement is presented. DFT amplitude spectral components of noisy signal are assumed to be random values. Upon first and second order statistic values estimation of noise-only spectrum, an enhancement of noisy signal spectrum was performed. As a reference, a fast discrete cosine transform based signal subspace (FDCTSS) m...
متن کاملSignal subspace methods for speech enhancement
This thesis focus on the theory analysis and algorithm aspects of signal subspace methods used for speech enhancement in digital speech processing The problem is approached by initially performing an analysis of subspace principles applied to speech signals in order to characterize the usefulness of de ning a signal subspace for this application The theory is formulated by means of the singular...
متن کاملA Brief Survey of Speech Enhancement1
We present a brief overview of the speech enhancement problem for wide-band noise sources that are not correlated with the speech signal. Our main focus is on the spectral subtraction approach and some of its derivatives in the forms of linear and non-linear minimum mean square error estimators. For the linear case, we review the signal subspace approach, and for the non-linear case, we review ...
متن کاملOptimized subspace weighting for robust speech recognition in additive noise environments
Signal Subspace (SS) based speech enhancement techniques obtain signi cant additive-noise reduction by altering the singular value spectrum of the speech observation matrix. Among the class of di erent possible SS weighting strategies, the Minimum Variance (MV) estimation method substantially increases the speech recognition accuracy in additive noise environments, outperforming the widely used...
متن کاملBias considerations for minimum subspace noise tracking
Speech enhancement schemes rely generally on the knowledge of the noise power spectral density. The estimation of these statistics is particularly a critical issue and a challenging problem under non-stationary noise conditions. With this respect, subspace based approaches have shown to allow for reduced estimation delay and perform a good tracking vs. final misadjustment tradeoff [2, 3]. One k...
متن کامل